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Einstein Telescope (ET) is conceived to be a third generation gravitational-wave observatory. Its 
amplitude sensitivity would be a factor ten better than advanced LIGO and Virgo and it could 
also extend the low-frequency sensitivity down to 1-3 Hz, compared to the 10-20 Hz of advanced 
detectors. Such an observatory will have the potential to observe a variety of different GW sources, 
including compact binary systems at cosmological distances. ET's expected reach for binary neutron 
star (ENS) coalescences is out to redshift z :^ 2 and the rate of detectable ENS coalescences could 
be as high as one every few tens or hundreds of seconds, each lasting up to several days. With 
such a signal-rich environment, a key question in data analysis is whether overlapping signals can 
be discriminated. In this paper we simulate the GW signals from a cosmological population of ENS 
and ask the following questions: Does this population create a confusion background that limits 
ET's ability to detect foreground sources? How efficient are current algorithms in discriminating 
overlapping ENS signals? Is it possible to discern the presence of a population of signals in the data 
by cross-correlating data from different detectors in the ET observatory? We find that algorithms 
currently used to analyze LIGO and Virgo data are already powerful enough to detect the sources 
expected in ET, but new algorithms are required to fully exploit ET data. 

PACS numbers: 



I. INTRODUCTION 

After a decade of detector installation and commis- 
sioning, ground-based detectors looking for gravitational 
waves (GWs) have reached or surpassed their design sen- 
sitivities and are poised to open up a new window onto 
the Universe, as well as allowing coincident searches with 
electromagnetic or neutrino detectors. The first genera- 
tion of interferometric observatories (GEO [1 , LIGO [2 
and Virgo [3]) have already put interesting constraints, 
for example, on the ellipticity of the Crab pulsar and 
on the cosmological stochastic background [5 . With the 
second generation starting in a few years, one expects 
to detect compact binary coalescences in the local Uni- 
verse [6], while third generation detectors, such as the 
Einstein Telescope [7 , should take GW astronomy to a 
new level, due to the large numbers of high SNRs of de- 
tectable sources, making it possible to address a range of 
problems on a wide variety of astrophysical sources but 
also in fundamental physics and cosmology. 

The coalescence of two neutron stars (ENS), two black 
holes (BBH) or a neutron star and a black hole (NS-BH), 
are the most promising sources for terrestrial detectors, 
due to the huge amount of energy emitted in the last 
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phase of their inspiral trajectory, merger, and ringdown. 
The maximum distance probed with current detectors is 
about 30Mpc [8 for ENS, but the next generation of de- 
tectors should be taking data with a sensitivity approxi- 
mately 10 times greater, pushing the horizon up to about 
450 Mpc [6]. With the third generation Einstein Tele- 
scope, the sensitivity will be increased by another order 
of magnitude and the horizon of compact binaries is ex- 
pected to reach cosmological distances [9]. Among other 
things, this will allow for a detailed study of the evolution 
of binary coalescences over redshift [10 , measurement of 
the mass function of neutron stars and black holes and 
of the neutron star equation of state [11] [12], and the 
use of binary neutron stars and neutron star-black hole 
binaries as standard sirens to constrain dark energy and 
its time evolution [TSHIS] (for a summary of ET science 
objectives see Ref. [9]). In such a large volume, however, 
the number of sources can be as large as a million and the 
waveforms may overlap to create a confusion foreground, 
especially at low frequencies where the signal can last for 
several days [TT. This could affect our ability to make 
individual detections and perform parameter estimation, 
and the issue deserves thorough study. 

With this in mind, we have simulated Einstein Tele- 
scope detector noise and added signals from a popula- 
tion of compact binaries, with a view to issuing a Mock 
Data Challenge (MDC) to the gravitational-wave com- 
munity. This could be used to develop advanced data 
analysis methods in order to separate the sources and 
measure the properties of both individual sources and of 
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the catalog as a whole. Initially we used a simple BNS 
inspiral signal model, but work is in progress to include 
other types of sources and more sophisticated scenarios. 
In the longer term we envisage issuing ET science chal- 
lenges to encompass not only detection of signals and 
parameter estimation, but also the application of such 
results to outstanding problems in fundamental physics, 
astrophysics and cosmology. 

In Section 2, we present the Einstein Telescope; in Sec- 
tion 3 we describe our procedure to simulate the mock 
data; in Section 4 we present the results of the search 
for both individual sources and the integrated signal; in 
Section 5 we discuss future developments for the mock 
data and in the search methods; finally in Section 6 we 
draw our conclusions. 



II. EINSTEIN TELESCOPE 

A conceptual design study was recently concluded for 
the proposed European project, the Einstein gravita- 
tional wave Telescope (ET) [18^. The goal of the study 
was to explore the technological challenges and the scien- 
tific benefits of building a third generation gravitational 
wave detector that is a factor 10 better than advanced de- 
tectors but also capable of observing at frequencies down 
to 1-3 Hz [7 . In this Section we will discuss the optical 
configuration of ET, different design choices for its sen- 
sitivity and ET's response to gravitational waves and its 
distance reach to compact binary coalescences. 



A. Optical topology and sensitivity 

Consideration of many factors including continuous ob- 
servation (duty cycle), ability to resolve the two polariza- 
tions of GW, and capacity to support new designs over 
many decades, led to the conclusion that the infrastruc- 
tures housing the current detectors will be inadequate to 
meet the design goals of ET. The Einstein Telescope is 
envisioned to consist of three V-shaped Michelson inter- 
ferometers with 60 degree opening angles, arranged in 
a triangle configuration (see Fig. [l] left panel). These 
are to be placed underground to reduce the influence of 
seismic noise. 

The design goal to push the sensitivity floor at low 
frequency down to 1-3 Hz comes from the requirement 
that ET be sensitive to more massive coalescing binaries 
than advanced detectors, i.e. intermediate mass BBH of 
masses in the range 10^-10^ Mq [19 - 22 , but also be able 
to observe stellar mass binaries for far longer periods be- 
fore they merge. With better low- frequency sensitivity, 
the subtle secular general-relativistic effects that occur 
in strong gravitational fields will accumulate over longer 
periods, as shown in Eq. (36), facilitating a deeper un- 
derstanding of GW sources. Additionally, in the case 



source's redshift to be measured [T5^, by breaking the de- 
generacy between the redshifted mass measured from the 
PN phase and the rest-frame mass measured from the NS 
tidal deformation phase. In [15 a lower frequency cutoff 
of 10 Hz was used; if this cutoff is reduced to 3 Hz, im- 
proving the accuracy of parameter estimation, the errors 
on recovered redshift are reduced by tens of percent, up 
to nearly a factor 2 improvement for sources at redshift 
4 [16^. 

As the understanding of the achievable sensitivity for 
Einstein Telescope evolved during the Design Study, dif- 
ferent sensitivity curves were considered. Early in the 
study the possibility was envisaged of pushing the low 
frequency limit down to 1-3 Hz in a single interferometer, 
while still achieving a broad-band improvement of an or- 
der of magnitude in sensitivity over Advanced detectors 
[23] . However, this is highly challenging, and perhaps 
technically unfeasible, since the technology that achieves 
better high frequency (> 100 Hz) sensitivity, i.e. higher 
laser power to bring down the photon shot noise, is in di- 
rect conflict with that required to improve low frequency 
(< 100 Hz) sensitivity, i.e. lower thermal noise and ra- 
diation pressure noise, since these are both worsened by 
higher laser power. 

Another design subsequently considered is the so-called 
xylophone configuration [24 . The idea is to install two 
interferometers in each V of the triangle, one that has 
good high-frequency sensitivity and the other with good 
low- frequency sensitivity [24^ 25^. Several other new tech- 
nologies, for instance frequency-dependent squeezing of 
light, have been studied in detail for the ET design [18] 
and must be combined to achieve the sensitivity goals of 
third generation detectors [26]. 

The main design parameters for ET to achieve a factor 
10 improved sensitivity over advanced detectors, while 
also achieving good sensitivity in the 3-10 Hz region, are 
as follows: 10 km arm lengths, 500 W of input laser and 
3MW of arm cavity power for the high frequency in- 
terferometer, and 3W of input laser and 18 kW of arm 
cavity power and the use of cryogenic technology (mir- 
rors cooled to 10 K), for the low frequency interferometer 
[T8] [25] . Fig. [l] right panel, compares the sensitivity 
of the initial single-interferometer configuration (ET-B) 
[2J with the xylophone configuration (ET-D) [25 which 
was the latest and most evolved design.^ Also plotted for 
comparison are the design sensitivity curves of advanced 
LIGO (high power, zero detuning: 'aLIGO') ^27j and ad- 
vanced Virgo ('aVirgo') and initial LIGO [29] and 
Virgo |30]. 



of binaries where one or both components is a neutron 
star, the improved low-frequency sensitivity will allow the 



^ Note that the low- frequency sensitivity floor of ET-D, compared 
to ET-B, is determined by a more detailed and realistic modelling 
of the suspension |25| . 



3 




FIG. 1: Left : Schematic configuration of the planned GW detector Einstein Telescope. Right : sensitivity for the initial 
configuration, ET-B, considered in the Design Study J^, and the most evolved configuration ET-D, compared to the sensitivity 
of first generation detectors LIGO and Virgo and the projected sensitivity of second generation (advanced) detectors, here the 
aLIGO high-power zero detuning sensitivity [27] and aVirgo [28]. 



B. Response function and antenna pattern 

Let us begin by looking at ET's response to GW sig- 
nals. Far away from a source, gravitational waves emit- 
ted by a system can be expressed in a suitable coordinate 
system as a transverse and symmetric trace-free (STF) 
tensor h'^^ (all temporal components of the metric pertur- 
bation vanish) whose only non-zero spatial components 
are 

/ill = -/l22 = Z,^^ f^l2 ^,^21 ^f^^^ 

Let (e^;, e^, e^) be an orthonormal triad in which the 
metric perturbation takes the transverse-traceless form. 
Then, using basis polarization tensors defined as 

e_l_ = Gj, Gy e^, Gy^ = Gx ^ H~ ^ ^x-) (2) 

the metric perturbation can be written as 

h = /i+e+ + /ix ex- (3) 

ET's interferometers can also be represented as STF ten- 
sors: 

= ^(ei (g) ei - 62 (8) 62), 

= ^(62(8)62 -63(8)63), 

d^ = ^(63(8)63-61(861), (4) 

where 61, 62 and 63 are unit vectors along the three arms 
of ET. The response h^{t)^ A = 1,2, 3, of the interferom- 
eters to an incident gravitational wave is just the inner 



product of the detector tensor d^ with the wave tensor 
h: 

h^{t) = df^ h'^ = df^e^ h+ + df^e'l /ix , (5) 
which motivates definition of the antenna pattern func- 
tions and : 

F^^^eY, F^^44, (6) 
in terms of which the response can be written as 

(t) = df^ h^^ = F^h+ + F^h^. (7) 

Let us now choose a coordinate system fixed to ET 
such that the three arms of ET's triangle are in the xy- 
plane and the unit vectors along the arms are 

ei = ^ (\/3, -1, 0) , e2 = ^ {V3, 1, O) , 
63 = (0, 1, 0). 

(8) 

Let (^, cp) be the direction to the source in this coordinate 
system with {gq^ gq) denoting directions of increasing 
and (p^ respectively. 

The unit vectors g^^ 6^ and 6^ defining the radiation 
frame can be obtained by successive counterclockwise ro- 
tations about the z-axis by an angle (p^ about the new 
?/-axis by an angle and the final z-axis by an angle t/j: 
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= {— ^in (p siml) -\- COS COS (f COS il) , cos (p siml) -\- cos sin (f cos iIj ^ — sin cos t/j) ^ 
By — {— sin (p COS t/j — COS COS (p sinip ^ cos(/:? cos?/; — cos6> sin(/? sin?/;, — sin6> sin?/^), 
= {sinO coscp^ sinO sincp^ cos^), 

where is the polarization angle defined by cosip = bq • e^. The antenna pattern functions of the interferometer 
whose arms are (ei, 62) is: 

F| = -^[(1 + cos^ l9) sin 2(/:) cos 2V^ + 2 cos l9 cos 2(/9 sin 2V;], (9) 

= +^[(1 + cos^ l9) sin 2(/:) sin 2V^ - 2 cos l9 cos 2(/:) cos 2V^]. (10) 



The antenna pattern functions are a factor sin 7 = a/3/2 
smaller than that of an L-shaped detector of the same 
length, where 7 = 7r/3 is the opening angle of ET's in- 
terferometer arms. 

The antenna pattern functions of the other two detec- 
tors in ET, with arms (e2, 63) and (es, ei), are obtained 
from F| and F^, by the transformation (p ^ (p ±27t/3 : 

Fl^i0,ip,i^) = Fl^^{0,ip + 2Tr/3,iP), (11) 
Fl^{e,^,xP) = F|,,(0,^-27r/3,tA). (12) 

Ff and F^ are sometimes called antenna amplitude pat- 
tern functions to distinguish them from their squares 
{Ff')'^ and (F^)^, which are called antenna power pat- 
tern functions [31]. The overall response of an interfer- 
ometer to an incident wave depends on the square root of 
the sum of the antenna power pattern functions F^-\-F^ . 
The joint response of all the three detectors in the ET 
network is 

F2 = ^(F^)2 + (F,^)2, (13) 

A=l 

which can be shown to be equal to 

= (1 + 6 cos^ e + cos"^ e) . (14) 

Thus, the joint antenna power pattern depends only on 
the colatitude of the source. ET's response is smaller 
compared to an L-shaped interferometer by a factor V3/2 
due to the 60° opening angle, but its 3 detectors enhance 
its response by a/3, leading to an overall factor of 3/2. 
This is indeed what we find: F{0) = 3/2. The response 
averaged over is y^{F^) = ^/2/5F{0) 0.63 F(0) and 
its minimum value is F{7t/2) = F{0)/VS 0.35 F(0). 
With an average response 63% of its optimum and a worst 
response 35% of its optimum, and with no null directions, 
ET has virtually all-sky coverage. 



C. Null stream 

It follows immediately from Eqs. Q, ([5| that the sum 
of the individual responses is identically equal to 

zero. The sum of the responses of any set of Michelson 
interferometers forming a closed path is zero and is called 
the null stream. As we shall discuss later, such a null 
stream is an invaluable tool in data analysis. 

Two L-shaped detectors with arm lengths of 7.5 km 
(and total length of 30 km), rotated relative to each other 
by an angle 7r/4, are completely equivalent to ET in terms 
of their response and resolvability of polarizations. How- 
ever, their response cannot be used to construct a null 
stream. 



D. Distance reach to compact binaries 

In 1986 Schutz showed [66 that inspir ailing binary sys- 
tems are standard candles whose (luminosity) distance 
can be measured from the observed gravitational wave 
signal, without the need to calibrate sources at different 
distances. Our detectors are able to measure both the 
apparent and absolute luminosity of the radiation, and 
hence to extract the luminosity distance of such a source: 
the magnitude of the gravitational wave strain gives the 
apparent luminosity but the rate at which the signal's 
frequency changes gives the absolute luminosity. 

For simplicity we shall consider a binary that is lo- 
cated at an optimal position on the sky (overhead with 
respect to the plane of ET) and optimally oriented (i.e. 
its angular momentum is along the line of sight). The 
discussion below holds good even when these assump- 
tions are dropped, but the measurement of the various 
angular parameters would be essential in order to dis- 
entangle the distance. This would require a network of 
three or more detectors with long baselines to triangulate 
the source's position on the sky. We will also only con- 
sider the GW quadrupole amplitude in this discussion; 
higher-order corrections to the amplitude do not affect 
our conclusions on ET's distance reach. 



5 



The magnitude of the strain measured by our detectors 
when the signal frequency reaches the value / is 



m 



2/3 



COS 



/' 

Jo 



f{t')dt' 



(15) 



where c is the speed of light, G the gravitational constant, 
A4 is the chirp mass of the binary, related to its total mass 
M — mi + m2 and symmetric mass ratio u = mim2/M^ 
by = i/^/^M, and D is the proper distance to the 
source. Note that this expression is valid in the limit of 
asymptotically flat, static spacetime; we will soon dis- 
cuss the effect of cosmological expansion on the observed 
signal. 

In addition to the signal's strain we can also measure 
the rate at which its frequency changes^ via 
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4f\ 



3/5 



G V967r8/3 dt J 



r 



-11/5 



(17) 



Thus, measurement of the signal strain and rate of change 
of frequency can together determine the system's chirp 
mass and its distance from Earth. 

For cosmological sources, however, the distance recov- 
ered by this method is not the comoving distance to the 
source x (equivalent to D for a spatially flat FRW uni- 
verse), but rather its luminosity distance Di, = (1 + z)X' 
This may be explained as follows: due to time dilation, 
the chirp mass of the system inferred from Eq. will 
be "redshifted" by a factor (1 + z), thus the signal will 
appear to have come from a source whose chirp mass is 
(1 + z)M.. Thus, if we reconstruct the masses of the bi- 
nary from the frequency evolution of the waveform, we 
will obtain redshifted masses a factor (1^ z) larger than 
the physical masses of the system at redshift z. Symbols 
such as m, M, M. will denote physical masses, whereas 
when discussing "redshifted" observed mass parameters 
we will use a superscript for instance mf = (1 + z)mi. 

This increase in apparent mass does not, however, 
mean that we will observe a greater signal amplitude: 
gravitational-wave amplitude, being dimensionless, can- 
not change due to redshift. Given this, and noting that 
A^/ is invariant under the effect of redshift, we find that 
a source with physical chirp mass M will appear to us to 
have a chirp mass (1 + and its apparent distance 

will be the luminosity distance Di^ = (1 z)x^ instead of 
the proper or comoving distance. 



^ In reality we don't directly measure the evolution of the fre- 
quency but use matched filtering to dig out the signal buried 
in noisy data. The end result, however, is the same. In fact, 
post-Newtonian approximation has allowed the computation of 
very accurate signal models which allows us to infer not only the 
chirp mass but also the mass ratio of the system. 



Let us now consider the distance reach of ET to an in- 
spiral signal from a compact binary of component masses 
mi and m2, at a luminosity distance Di^ and whose or- 
bit (assumed here to be quasi-circular) makes an angle i 
with the line of sight. There exist different measures of 
the distance reach of a detector: the horizon distance is 
commonly used in data analysis (see, for instance, [32^), 
while detector range and range functions were defined by 
Finn and Chernoff [33 and are routinely used as a mea- 
sure of detector performance. Our measures of distance 
reach are inspired by all of these concepts. 

The signal-to-noise ratio (SNR) pA for a given sig- 
nal (such as from an inspiralling binary), detected by 
matched filtering with an optimum filter, in a detector 
labelled A, is 



^0 



\HA{f)? 
Sn{f) 



d/, 



(18) 



where HA^f) is the Fourier transform of the response 
of detector A and Sn{f) is the one-sided noise power 
spectral density (PSD) of the detector, which we assume 
to be the same for all three detectors in the ET array. A 
good analytical fit to the ET-B noise PSD is given 
by SM) = lO-''hn{f)^Rz-\ where 

hnif) = 2.39 X 10-^'^ x-^^-^'^^ 0.349 x-^-^"^^ 

+ 1.76 +0.409 x^-^^ (19) 

and where x = //lOOHz. We may write the detector 
response in terms of two GW polarizations via HA{f) = 
F^H^ + F^Hy, , where 

V {GMn'^' (1 +COS^O ^-7/6^ (OQ) 




24 7r2/3c3/2l)L 
24 7r2/3c3/2l)L 



COS i / 



-7/6 



(21) 



The coherent SNR p for the ET network, for uncorrelated 
noises in the three detectors, is simply the quadrature 
sum of the individual SNRs: = J] Pa- We discuss 
possible correlated noise in Sections [ill B[ for the Gaus- 



sian noise budget, and V A concerning possible corre- 



lated noise transients. For the present idealized sensitive 
range calculation we consider uncorrelated noises. 

For low mass systems such as ENS, the SNR is domi- 
nated by the inspiral part of the signal; the coherent SNR 
can then be shown to reduce to 
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6 c%4/3 n2 
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-7/3 



Snif) 



d/, 



(22) 



where /i and /2 are lower and upper frequency cutoffs 
chosen so that the integral has negligible (say, < 1%) 
contribution outside this range and is a function of all 
the angles given by 



A >- 



(1 + cos" 0'(^+)'+cos"^(F^) 



?A\2 



(23) 
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Here F^^^ A = 1, 2, 3, are the antenna pattern func- 
tions of the detector given by Eqs. ([9|-(12). Substituting 
for the antenna pattern functions and summing over the 
three detectors gives 
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J'^(l9, (f, ip, l) = 7^ (1 + cos^ 0^ (1 + cos^ <9)%os^ 2^lJ 
(l + cos^ i) ^ cos^ Q sin^ 2?/^ 



cos^ ^ (l + cos^ d) sin^ 2i\) 



+ -cos ^ cos ^cos 



(24) 



The quantity T determines the SNR of a source of a 
given (observed) chirp mass at any given distance. Al- 
though the antenna power pattern is independent of 
-0), the quantity T is only independent of ^. For cer- 
tain source locations and orientations, the response is still 
independent of the polarization angle. For instance, ei- 
ther when the source is "overhead" with respect to ET's 
plane [i.e. 6> = 0,7r) or face-on [i.e. i = 0,7r), T is inde- 
pendent ofi/j. It depends weakly on ip for values of 6 and 
L significantly different from these extreme values. The 
maximum value J-max = 3/2 is obtained when = l = 0^ 
while the value of T'^ averaged over (6>, ip, i) is 



1 f^TT 
J^=— / / / sin i9 sin ^di9d^dV^ 

Stt Jo Jo Jo 



_9_ 
25' 



So the root-mean-square value of is J^rms = V = 
3/5. The horizon distance Di, of a detector is defined 
as the maximal distance at which an optimally oriented, 
overhead binary (^i.e. i = 6 = 0) can be detected above 
a threshold SNR of p = pT^ chosen large enough to keep 
the false alarm rate acceptably low; pr = S is considered 
reasonable for current detectors. Noting that T = 3/2 
when L = = 0^ for ET the horizon is given by 




2/3^3/2 



pT 



f- 



-7/3 



/i 



Shif) 



1/2 



(25) 



The horizon distance is not a very useful measure since 
essentially no signals can be detected beyond this dis- 
tance with an SNR larger than p^- A more meaningful 
measure of the reach is the distance Di, at which an "av- 
erage" source, meaning one for which T = Trms = 3/5, 
produces an SNR of pr- For such a source we obtain 



5 



(26) 



For a binary consisting of two components of (physi- 
cal) mass 1.4 Mq and for a threshold px = 8, we find 
i^L — 13 Gpc or z = 1.8, and Di^ 37Gpc or 2: = 4; 
these distances can be larger for more massive binaries, 
and our simulated binary component masses extend up 
to 3Mq. In our simulations, we inject signals of differ- 
ent orientations and polarization angles distributed uni- 
formly over comoving volume up to a redshift of 2; = 6. 



E. Efficiency vs. distance 

The efficiency of a detector at a given distance, and 
for binary sources with given physical component masses, 
is the fraction of such sources for which ET achieves an 
expected SNR p > pr- ET will not be sensitive to every 
BNS merger at any given distance, but only to those that 
are preferentially located in certain sky directions and 
are suitably oriented [33]. The fraction €{Di,) of sources 
detected by ET at a given luminosity distance is given 

by 

e{Di^) = — / / n(p/pT - 1) sin l9 sin ^d(9d^dV^, 
Stt Jo Jo Jo 

(27) 

where 11 is the unit step function Il{x) = if x < and 
n(x) = 1 if X > 0. Note that p is a function of all angles, 
luminosity distance, redshift, etc. In Figure [7| top right, 
we plot ET's efficiency as a function of redshift for binary 
neutron stars: the blue solid curve shows the efficiency 
for physical masses mi = m2 = 1.4 M©, choosing a SNR 
threshold pr = 8 and a lower frequency cutoff /i = 1 Hz. 
As shown in this figure, ET should have 50% efficiency 
at a redshift of z ~ 1.3, while its efficiency at z = 1.8 
(distance at which the angle- averaged SNR is 8) is 30%. 



III. SIMULATION OF ET MOCK DATA 

In this section we will discuss how ET mock data was 
generated. We will describe the cosmological model used 
and the rate of coalescence of binary neutron stars as a 
function of redshift. We will also discuss how the back- 
ground noise was generated and the waveform model used 
in the simulation. 



A. Simulation of the GW Signal 

We use Monte Carlo techniques to generate simulated 
extra-galactic populations of binary neutron stars and 
produce time series of the gravitational wave signal in 
the frequency band of ET. We first describe how the dis- 
tribution of injected BNS sources over redshift and mass 
was obtained, and then explain the simulation pipeline 
summarized in Fig. [3j 

We first consider the rate of BNS coalescences in the 
Universe. We neglect the possible production of com- 
pact binaries through interactions in dense star systems, 
and we assume that the final merger of a compact bi- 
nary occurs after two massive stars in a binary system 
have collapsed to form neutron stars and have inspi- 
r ailed through the emission of gravitational waves. The 
merger rate tracks the star formation rate (SFR), albeit 
with some delay td from formation of the binary to final 
merger. We use the SFR of [36 , which is derived from 
new measurements of the galaxy luminosity function in 
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FIG. 2: Left: Average waiting time as a function of maximal redshift. Right: Probability distribution of the redshift, assuming 
the star formation rate of 36 , a distribution of the delay of the form P{td) oc 1/td with minimal delay of 20Myr and a local 
coalescence rate density of 1 Mpc""^ Myr~^. 



the UV (SDSS, GALEX, C0MB017) and FIR wave- 
lengths (Spitzer Space Telescope), and is normalized by 
the SuperKamiokande limit on the electron-antineutrino 
flux from past core-collapse supernovas. This model is 
expected to be quite accurate up to 2; ~ 2, with very 
tight constraints at redshifts z < 1 (to within 30 — 50%). 

Following [17 , we write the coalescence rate density 
pc{z) (in Mpc~^ yr~-^) as: 



Pc{z) (X 



1: 



l-^Zf{z,td) 



P(td)dtd with pc(0) = po , 
(28) 

where is the SFR of [36^ (in Mpc"^ yr-^), z the 
redshift when the binary system merges, Zf the redshift 
when the binary system is formed, P{t(i) the probability 
distribution of the delay connecting z and z/, and po 
the rate density in our local universe. The normalization 
condition reproduces the local rate density for z = and 
the factor (1 + ^/)~^ converts the rate density in the 
source frame into a rate density in the observer frame. 

The redshifts Zf and z are related by the delay time 
id which is the sum of the time from the initial binary 
formation to its evolution into a compact binary, plus 
the merging time after which emission of gravitational 
waves occurs. The delay is also the difference in lookback 
times between Zf and z: 



td 



Ho 



dz' 



{l^z')E{n,z') 



where 



(29) 



(30) 



In these simulations, we have assumed a flat Universe 
with Clm = 0.3 and = 0.7 and Hubble parameter 
Ho = 70km s-^Mpc-^ 



We assume a distribution of the form P{td) cx 1/td^ as 
suggested by population synthesis [37 , with a minimal 
delay t^^^ = 20 Myr, corresponding roughly to the time 
it takes for massive binaries to evolve into two neutron 
stars [3F . This broad model accounts for the wide range 
of merger times observed in binary pulsars; it is also con- 
sistent with short gamma ray burst observations in both 
late and early type galaxies [39 . 

The coalescence rate per redshift bin is then is given 

by 



dR dV 

where dV/dz is the comoving volume element: 

dV , . , c r{zf 
dz^ ^ Ho E{n, z) ' 



where 



r{z) 



-I 

Jo Jo 



dz' 



Ho Jo E{n,z' 



(31) 



(32) 



(33) 



is the proper distance. 

The average waiting time At between signals is cal- 
culated by taking the inverse of the coalescence rate, 
Eq. (31), integrated over all redshifts: 



A 



Jo 



dR 

dz 



1 -1 



{z)dz 



(34) 



Fig. [2j left panel, shows At as a function of the maximal 
redshift z^ax out to which events are generated, given 
a local coalescence rate of po = 1 Myr~^ Mpc~^ which 
corresponds to the galactic rate estimated in [35], and 
which we adopt here. 
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FIG. 3: Flow diagram of the Monte Carlo simulation code 



We assume that signals arrive at the detector as a Pois- 
son process and draw the time intervals At = —t^ be- 
tween successive coalescences at times and t^, from 
an exponential distribution P{At) = exp(— At/A). Coa- 
lescence times t^ are generated between the start time of 
the observation t^ and the end time t/, to which we add 
the maximal duration Tmax that a source can have in our 
frequency range (a 1.2 + 1.2 system at z = 0). 

Then, we proceed as follows for each source: 

1. The physical masses of the two neutron stars are 
drawn from a Gaussian distribution with mean 
1.4 Mq and variance 0.5 M©, and are restricted to 
the interval [1.2,3] Mq. 

2. The redshift is drawn from a probability distribu- 
tion p{z) obtained by normalizing the coalescence 
rate dR/dz in the interval 0-Zmax* 

p,{z) = X—{z). (35) 

Next we calculate the duration of the waveform in 
our frequency range: 

where /i is the low-frequency cutoff of the detec- 
tor; due to computational limitations in this initial 
study we take /i = 10 Hz for the simulated signals. 

3. For each source visible in our observation time- 
window [ti,t/], the source's location in the sky, its 



orientation, the polarization angle and the phase 
at the coalescence are drawn from uniform distri- 
butions. 

4. The gravitational wave signal h{t) = F_^{t)h-^{t) + 
Fxhx{t) of the source is calculated for each detec- 
tor El, E2 and E3 and for each observation time 
t^ until the frequency reaches /i, and is added to 
the time series. In these simulations, we have used 
so-called TaylorT4 waveforms [49 , up to 3.5 post- 
Newtonian order in phase 0(t) and only the most 
dominant lowest post-Newtonian order term in am- 
plitude: 

/i+(t) = A(t)(l + cos^Ocos[(^(t)] (37) 
hy,{t) = 2A(t)cos^cos[0(t)] (38) 

where ^, as before, is the inclination angle of the 
binary with respect to the line of sight. 

The signal amplitude is then 

where the luminosity distance Di^ is in Mpc, 
in M0, and where /(t) in Hz is the instantaneous 
gravitational- wave frequency (twice the binary's or- 
bital frequency) which increases monotonically as 
the system shrinks and gets closer to merger. For a 
description of the TaylorT4 approximant and how 
it relates to other waveform approximant s, see |50] 
and references therein. 
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Theoretically, neutron stars could have maximum di- 
mensionless spins x = cJl(GM^\ where J is the star's 
angular momentum and M its mass, as large as 0.5 to 
0.7, depending on the equation of state [51 . These are 
moderately large spins and including spin effects in our 
waveform model would be essential for unbiased and ac- 
curate parameter estimation in real searches. However, 
in this exploratory work we neglect spins, as our main 
aim is to investigate the difficulty of discriminating over- 
lapping signals. From an astrophysical point of view, 
neutron stars in coalescing binaries, such as the Hulse- 
Taylor binary, have rather small spins of ~ 6 x 10~^, 
which will not significantly affect the phase evolution of 
the signal. 

An example time series of the gravitational wave signal 
including sources up to a redshift z ~ 6 (before adding 
simulated detector noise) is shown in the top plot of 
Fig. |4j left panel. Although the sources overlap strongly 
in time, they are well separated in frequency, or become 
so when close to coalescence: an exception could be if two 
BNS signals with similar redshifted chirp masses were ap- 
proaching coalescence within < 1 s of each other. Due to 
the form of the detector PSD, the main contribution to 
the matched filter power of any binary coalescence signal 
occurs when the chirping frequency is close to 100 Hz; 
the "chirp" is sufficiently rapid at (and after) this point 
that different sources can be clearly resolved. This is 
illustrated by the bottom plot of Fig. [4j left panel, show- 
ing the optimal time domain filter, i.e. the inverse Fourier 
transform of the frequency-domain signal weighted by the 
noise power spectral density. The detector PSD acts like 
a bandpass filter, weighting down the lower frequencies 
where the signal spends most of its time. The effective 
lenghts of signals, as "seen" by ET, are, therefore, a lot 
shorter than they actually are. Consequently, overlap- 
ping signals seen in the upper panel do not lead to a loss 
in detection efficiency as we shall show in Sec IVB. 



B. Simulation of the noise 

In order to produce the data set, it is necessary to 
use a model of the expected noise for the ET detectors. 
To this end, we assume that the noise will be stationary 
and Gaussian. Moreover, for the time being, we assume 
that the noise realizations in the different detectors are 
uncorrelated. 

In reality the noise in collocated detectors will, most 
likely, have some correlated components, as has been seen 
in the two LIGO Hanford detectors. Different ET in- 
terferometers will be separately isolated in vacuum sys- 
tems, thus we do not expect correlations of thermal or 
quantum noises, which form the main contribution above 
about 10 Hz. Common noise arises most significantly as 
a result of having end stations with similar physical en- 
vironments for two detectors: in particular, concerning 
ET's low-frequency sensitivity, seismic and Newtonian 
noise displacements. For ET, however, it is envisaged to 



stagger the end stations of the three interferometers by 
1 km. Whether this will reduce the common displacement 
noises to the extent that the detectors can be considered 
independent is a question under current investigation. 

The noise for each detector was generated using the 
following procedure: Firstly, we generate a Gaussian time 
series with a mean of zero, and unit variance. These 
time series are Fourier transformed and coloured by the 
relevant ET sensitivity curve in the frequency domain. 
To get the final time-domain representation of the noise, 
we apply an inverse Fourier transform. 

The noise curve used is based on the analytic fit of 
Eq. (irgl) to the ET-B PSD discussed in Section II. To al- 



leviate the effects of possible discontinuities across frame 
files, the PSD is gradually tapered to zero below the low 
frequency limit fi = 10 Hz, and above a frequency of 
/2 = /Nyquist/2. Fig.jlj right panel, shows the noise curve 
used to colour the data, with the tapering applied, for a 
sample rate of 8192 Hz. The taper essentially acts like a 
bandpass filter and removes power outside the band of in- 
terest. The absence of very high and very low frequencies 
essentially assures continuity across the data segments 

EH. 



IV. FIRST ANALYSIS 
A. Null stream 

A null stream is a combination of the detector output 
streams such that the gravitational wave signal is iden- 
tically zero and only noise remains. The existence of an 
ET null stream was noted already in [53 and is a major 
motivation for the triangular triple Michelson topology. 
Given an incident GW tensor hij^ the three interferom- 
eter responses were derived in Eq. ([s]), from which, as 
already remarked in Section II C , we find that the sum of 
the three detector responses to any GW signal vanishes 
identically. We may define the null stream as the sum 
of the strain time series x{t) for the three ET detectors. 
For each single detector A we have 



x^{t) = n^{t)^df^h'^{t), 
where n^{t) is the noise realization, thus 

3 



(39) 



Xnnllit) = ^X-^it) 
A=l 
3 3 

A=l 1=1 

= E^^'W (40) 



A=l 



is free of GW signals, and will also not contain any com- 
mon (correlated) noise for which the sum over the three 
detectors happens to vanish. 
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Frequency / Hz 



FIG. 4: Left: Simulated time series of the gravitational strain at detector El, for Zmax = 6 and /i = 10 Hz (top) and the 
same time series after the Fourier transform has been divided by the noise power spectral density of ET. Right: The tapered 
projected ET noise spectrum used to color the noise. Example audio files of the simulated GW signal alone or in the presence 
of noise can be found at the ET MDC website http://www.oca.eu/regimbau/ET-MDC_web/ET-MDC.html 



If the noise properties are homogeneous among the de- 
tectors, 

S'nif) ^ Slif) c Slif) , (41) 

and if correlations between detectors can be neglected, 
we can use the null stream to estimate the average PSD 
in each of the three detectors. In this case, 




^ 3i5(/-/')5;f(/), (42) 

where Xnuii(/) is the Fourier transform of Xnu\\{t) and, in 
the last line, S^{f) is the noise PSD in any of the three 
interferometers in the absence of a GW signal. Defining 

(X„,n(/)X:,ii(/')) = \5{f - /')5„,„uu(/), (43) 
we find 

S^{f) ^ ^5„,„uu(/) (44) 

as an estimate for the individual single-interferometer 
PSDs with the signals removed. 

The null stream PSD, which we plot on the left panel 
of Fig. [Sj then has the advantage of giving a better rep- 
resentation of the noise content of the three detectors. 
The typical sensitivity improvement is nonetheless fairly 
small, about 1% in the 10 - 100 Hz band. As a proof 
of principle of the effectiveness of the use of null stream 



PSD instead of the single detector one, we computed the 
median over the whole dataset of the difference between 
(one-third of) the null stream PSD and the individual 
detector PSD's S^{f). These residuals should be consis- 
tent with the median PSD of the injected signals in each 
detector: 

S^U) - \Sn,nuXX ^ Hif) (45) 

where H{f) is the power spectral density of GW sig- 
nals. The result of this operation is shown in Fig.[5j right 
panel. The residual spectrum between 10 and 400 Hz in 
each detector is consistent with the theoretical expecta- 
tion Snif) ~ 

B. Compact Binary Coalescence analysis 

We analysed the triple coincident simulated data us- 
ing a modified version of the LIGO- Virgo Ihope pipeline 
[El Ell [40l m] which is used to search for signals from 
compact binary coalescences (CBC). This pipeline is a 
coincident analysis: data streams from different detec- 
tors are separately filtered against template waveforms 
and the resulting maxima of SNR are checked for con- 
sistency between detectors. The main motivation of this 
procedure is to reduce computational cost when analyz- 
ing data from spatially separated detectors with a-priori 
unknown duty cycles. 

Coherent analysis, where data streams are combined 
before finding maxima of SNR, should in principle be 
more sensitive at fixed false alarm rate if many detectors 
are involved [42H46] . For ET, the detector outputs could 
be combined into a null stream and synthetic + and x 
detectors, and for the 2 non-null streams, the coherent 
detection statistic is then identical to the coincident one 
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FIG. 5: Left — Sample PSD calculated from the null stream, compared to the theoretical ETB fit|19| The null PSD is estimated 
within data segments of length 2048 s by averaging non-overlapping samples each of 1 s length, and is then averaged over all 
2048 s long segments in the dataset. The figure shows \/5'n,nuii(/)/3. Right — Difference between (one-third of) the null stream 
PSD and the Ej PSDs obtained by averaging over the whole dataset, as defined in Eq. (45). The residuals are consistent with 
the f~^^^ spectrum expected from binary inspiral signals. To aid visibility, the quantities plotted have been scaled by lO'^^ and 
the constant 0.01 has been added. 



[45^. Hence unless other sites contribute there is no gain 
expected specifically from using a coherent analysis. In 
our case we might expect a small gain in sensitivity by 
using synthetic + and x data, since it eliminates a frac- 
tion of the noise from each detector (the contribution 
to the null stream) while keeping all the signal power. ^ 
However to establish an initial benchmark we have kept 
the existing framework where each physical detector is 
filtered separately. 

The stages of the coincident pipeline are as follows: 

• Estimation of the PSD by median over several over- 
lapping time chunks within a 2048 s segment. We 
use the single-detector outputs rather than the null 
stream to estimate the noise: in principle loud sig- 
nals could bias this estimation, however as shown in 
Fig. |5] any such bias is on average extremely small; 
we also compared the estimated sensitivity over dif- 
ferent segments of single-detector data and found 
negligibly small differences. 

• Generation of a template bank covering the chosen 
parameter space of binary masses 

• Matched filtering of each template against the data 
stream of each detector to generate an SNR time 
series p{t) 

• Trigger generation: for each template, maxima of 
SNR over a sliding time window of length 15 s were 



^ Alternatively, one can view this recombination as creating two 
synthetic detectors with 90° opening angles and slightly better 
sensitivities than each of the original three detectors. 



found, and a "trigger" was generated if any such 
maxima exceeded an SNR of 5.5 

• Clustering to reduce trigger numbers: if there are 
multiple triggers within a small region of parame- 
ter space (binary masses plus time [54 ) the trigger 
with largest SNR is selected and others in the re- 
gion are discarded 

• Coincidence between detectors: only pairs or triples 
of triggers with consistent coalescence times and 
masses [55, survive and are designated as events. 

• Ranking of events by combined SNR^, (sum of 

over coincident triggers). 

There are several differences compared to standard 
LIGO- Virgo searches. The main ones concern the fre- 
quency range of data searched, the parameter space of 
the search and the method for determining the signifi- 
cance of candidate events. 

The length of an inspiral template increases rapidly 
with the lowest frequency that is matched filtered in the 
analysis (Eq. (|36|). For technical reasons related to mem- 
ory load and PSD estimation, the standard matched fil- 
ter code used for LSC- Virgo analyses [40] cannot filter 
templates longer than a few minutes: hence we chose to 
impose a lower frequency cutoff of 25 Hz. This limita- 
tion should be addressed in future analyses, and may be 
relevant to analysis of Advanced LIGO/Virgo data. 

The template bank was chosen to cover the possible 
range of redshifted {i.e. observed) mass pairs correspond- 
ing to the BNS injections up to redshift 4. The mini- 
mum component mass was taken as 1.2 M©; with a max- 
imum injected component mass of SM©, the observed 
total mass at 2: = 4 is then 15 M©, which we took as 



12 




10^ 10^ 

Combined snr 



FIG. 6: Cumulative histogram of CBC events as a function 
of combined SNR pc, divided into double (two-detector) and 
triple (three-detector) coincidences. 



our maximum component mass, with a maximum total 
mass of 30 Mq. The maximum injected mass ratio is 
3/1.2 = 2.5 corresponding to a "symmetric mass ratio" 
T] — mim2/{mi + m2)^ — 0.204, thus templates with 
T] < 0.2 were removed, considerably reducing the size of 
the bank. 

Since the simulated noise was Gaussian, signal-based 
vetoes and data quality vetoes were not necessary to sup- 
press detector artefacts, and events were ranked simply 
by the quadrature sum of SNR over coincident triggers. 
The noise background in our mock data is expected to be 
a function of combined SNR alone, thus we set a thresh- 
old in the value of pc above which we consider an event 
likely to be a true GW signal. 

Note that the time shift method used to estimate back- 
ground event rates in LIGO-Virgo searches fails here. In 
order for such methods to be valid, the number of de- 
tectable GW events over the search time should be small 
(of order 1): otherwise, loud triggers due to true GW 
signals may significantly distort the background distribu- 
tion, by forming random time-shifted coincidences with 
noise triggers. In the present case we see tens of thou- 
sands of detectable signals, thus the distribution of loud 
time-shifted coincidences is totally dominated by such 
'signal-plus- noise' events. 



1. Events found by CBC analysis 

The CBC analysis outputs a list of loudest events with 
the coalescence time, combined SNR, and the compo- 
nent masses of the best-fitting template for each event. 
The distribution over pc is plotted in Figure [6] for both 
double- and triple-coincident events. Given the single- 
detector SNR threshold pt = 5.5, the quietest possible 
double coincidence has pc = V^pt — 7.78 and the qui- 
etest triple has pc = V^pt — 9.53. The expected cumu- 



lative distribution of events from an astrophysical popu- 
lation is approximately proportional to the inverse cube 
of combined SNR (thus to the cube of the luminosity dis- 
tance, or to the volume of space seen by the search). De- 
viations from this inverse-cube behaviour will arise due 
to evolution of the source population over redshift, also 
because the physical volume of space is no longer exactly 
proportional to distance^ at large and also since the 
observed masses of a coalescing binary are larger than the 
physical masses by a factor (1 + ^), changing the expected 
SNRs. 

Over most of the range of pc the distribution of triple 
coincidences is close to ~ p^^ as expected; with decreas- 
ing combined SNR values, an increasing fraction of sig- 
nals are seen as double coincidences. We see no signifi- 
cant background distribution of triples, which would be 
expected to rise exponentially at small combined SNR. 
Thus in principle the efficiency of the search could be 
improved by lowering the SNR threshold. 

The distribution of double coincidences shows two 
components: an approximate power-law at higher pc and 
a more rapidly rising component below about pc = 9. We 
interpret these as a cosmological population of sources, 
modulated by the variation in the proportion found as 
doubles vs. triples; and a Gaussian noise background, re- 
spectively. Thus we expect that above a combined SNR 
Pc ^ 9 the great majority of events will be caused by 
binary coalescence signals rather than random noise. 



2. Efficiency and accuracy 

We evaluate the search efficiency as a function of 
redshift by testing time coincidence between simulated 
signals and found events (using a "coalescence time" 
at which the chirping signal reaches a well-defined fre- 
quency) and choose a time window of ±30 ms. For a 
given event or injection there are the following cases: 

• False event: an event which does not fall within 
30 ms of an injection 

• True event: an event falling within 30 ms of one or 
more injections 

• Missed injection: a simulated signal which does not 
fall within 30 ms of an event 

• Found injection: an simulated signal within 30 ms 
of a found event. ^ 

However, if we have very frequent candidate events 
or injections, we may encounter significant numbers of 



^ Note that events are clustered over time windows of a few sec- 
onds, thus more than one event cannot be found within a 30 ms 
window. 
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wrongly found injections^ meaning chance time coinci- 
dences between injections and noise events where the 
expected SNR of the injected signal is below the anal- 
ysis threshold. For these we do not expect the estimated 
mass parameters and effective luminosity distance from 
the analysis pipeline to correspond to those of the simu- 
lated source; the fractional error in these parameters will 
be order (1). Wrongly found injections would lead us to 
overestimate the search efficiency and would degrade the 
accuracy of recovered source parameters. To minimize 
such effects whenever two or more simulated signals fall 
within ±30 ms of an event, we consider only the injec- 
tion with the lowest redshift to be found. In practice this 
ambiguity is found to affect only a small fraction (sub- 
percent) of signals. In order to minimize possible bias 
in assessing the accuracy of recovered source parameters, 
we do not impose any further requirement (for instance, 
on the chirp mass) in order for an injection to be counted 
as found. 

As mentioned above, we require a threshold on pc to 
limit the number of false events caused by noise. Here we 
choose to impose pc > 8.8, finding 36774 events above 
this threshold in the 2419200 s of data analyzed. By com- 
paring these with the catalogue containing 177350 sim- 
ulated coalescence signals over the analysis time we find 
850 false events, giving a directly estimated false alarm 
probability (FAP) of 2.3%. The efficiency of finding injec- 
tions as a function of redshift is summarized in Figure [7| 
top two panels. 

In the top right plot we compare the efficiency of the 
current analysis with the theoretical ideal efficiency de- 
fined in Eq. (27), for two different values of the threshold 
SNR pT and the low frequency cutoff /i. The ihope anal- 
ysis does somewhat worse than the corresponding the- 
oretical curve, which can in part be attributed to the 
single-detector SNR threshold pt = 5.5; the theoretical 
calculation does not impose a lower limit on the ampli- 
tude of signals in single detectors contributing coherently 
to the significance of an event. 

We evaluate the accuracy of the recovered (observed) 
chirp mass A4^ via the discrepancy (Al^ec ~ -^fnj)/-^fnj 
plotted in Figure [7| lower left plot.^ The vast majority 
of events found have a well recovered chirp mass with an 
accuracy better than 0.5%, even for the small number 
of sources recovered at redshift z > 4: the number of 
wrongly found injections with violently inaccurate A4 is 
order(lO). The chirp mass is the chief parameter govern- 
ing the frequency evolution of a compact binary system 
due to its emission of energy in GW, thus we can deduce 
the luminosity in GW of such systems with good accu- 
racy out to extremely large distances. But note again 
that we cannot determine the system's physical masses 
without an independent determination of its redshift. 



The distribution of errors in observed total mass 
is significantly broader: see Figure [7| lower right plot, 
where there is a slight overall bias towards overestimating 
and a small population of injections for which the 
total mass is overestimated by 5-10%. We find that this 
population consists of nearly equal-mass binaries which 
are found with somewhat more asymmetrical templates. 
Since the inspiral signal is significantly less sensitive to 
changes in mass ratio or total mass keeping a fixed 
than vice versa, we expect a larger spread in recovered 
M values than in J\4. We might also expect a bias in the 
recovered mass parameters due to taking the maximum 
SNR value over some region of the template bank, since 
the density of templates is greater at smaller M. and at 
smaller r]. The recovered mass ratio is also expected to 
be more sensitive to systematic differences in injected vs. 
template waveforms. 



C. Stochastic 

The superposition of the GW signals from our popula- 
tion of BNS creates a background which is expected to be 
isotropic (the source position in the sky and polarization 
were selected from a uniform distribution) and station- 
ary (the length of the data is much greater than the time 
interval between successive events, and the duration of 
the waveform). Its properties in the frequency domain 
can be characterized by the dimensionless energy density 
parameter ^47j: 



^gw if) 



1 dpg^ 

Per dlnf 



(46) 



where is the gravitational energy density and per = 
Sc'^Hq /{SttG) the critical energy density needed to make 
the Universe flat today. 

This quantity is related to the one-sided (/ > 0) power 
spectral density in gravitational waves, at the detector 
output (Sh) ' 



^gw (/) 



IOtt^ 



(47) 



^ The fractional error in observed chirp mass JV[^ is mathemati- 
cahy identical to the fractional error in physical chirp mass JV[. 



where 7 is the opening angle of the interferometer arms. 

Note that the background from BNS is not Gaussian 
at frequencies > 10 Hz, in the sense that the number of 
sources overlapping at a given time is too small for the 
central limit theorem to apply and for the distribution 
of the sum of the amplitudes to have a Gaussian distri- 
bution. Thus, knowledge of Vtgw{f) does not completely 
specify the statistical properties of the background, as 
there may be non- vanishing moments other than the vari- 
ance. In particular, the amplitude distribution of the GW 
signal may exhibit large tails compared to the Gaussian 
case. 

For the population of neutron stars distributed accord- 
ing to the probability distributions discussed in Sec. III. A 
for the mass, redshift, position in the sky, polarization 
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FIG. 7: Top left — Distributions of all BNS injections, and those found by the CBC pipeline, vs. redshift. Here events with 
pc > 8.8 were considered as candidate signals. Top right — Efficiency of the CBC search vs. redshift. We show the theoretical 
(ideal) efficiency as defined in Eq. (27) for a threshold SNR of pT = 8 and a low frequency cutoff /i = 1 Hz, and also for 
Pt — 8.8, /i = 25 Hz for comparison with the signals found by the ihope pipeline. Bottom left — Histogram of fractional errors 
in chirp mass. Bottom right — Histogram of fractional errors in total mass. 




FIG. 8: Left — Overlap reduction function for two V-shaped ET detectors separated by 120 degrees. Right — Energy density 
parameter of the background produced by the coalescence of binary neutron stars, as a function of observed frequency. 
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and inclination, the predicted flg^j is shown in Fig. |8j 
right panel, and can be derived from the expression ^56.- 
[58]: 



Pc 



-fF{f) 



(48) 



where the integrated flux at the observed frequency / is 
given by the sum of all the individual contributions at all 
redshifts: 



F{f) = 



r-^max 



dz 



AttDUz) df 



'^^^-if,M{l + z)Az) (49) 



df 



the spectral 



where Dj^ is the luminosity distance, 

energy density averaged over orientation and M. is the 
average physical chirp mass of the population. 

In the quadrupolar approximation, and assuming a cir- 
cular orbit, 



df 



for / < fL 
(50) 

where ff^^ = {1 -\- z) ^ fiso is the observed (redshifted) 
frequency at the last stable orbit. The predicted energy 
density parameter increases as /^/^ before it reaches a 
maximum Vtgyj ~ 4 x 10~^ at around 600 Hz, with a 
reference value at 100 Hz of l^ref = 1-9 x 10~^. 

The strategy to search for a Gaussian (or continuous) 
background, which could be confused with the intrinsic 
noise of a single interferometer, is to cross-correlate mea- 
surements of multiple detectors. When the background 
is assumed to be isotropic, unpolarized and stationary, 
the cross-correlation product is given by [47] 



Y 



m)Q{f)^2{f)df 



(51) 



and the expected variance, which is dominated by the 
noise, by 



4 



siif)si{f)m)\'df, 



where 



Q(/)« 



7l2(/)»gw(/) 



(52) 



(53) 



is a filter that maximizes the signal-to- noise ratio {S/N). 
In the above equation, and are the one-sided power 
spectral noise densities of the two detectors and 712 is the 
normalized overlap reduction function, characterizing the 
loss of sensitivity due to the separation and the relative 
orientation of the detectors: see Fig. [S) left panel. For 
two V-shaped detectors (7 = 7r/3) separated by /3 = 
27r/3 degrees, 712(0) = sin^(7) cos(2/3) = -3/8. The 
normalization ensures that 712 = 1 for co-located and 
co-aligned L-shaped detectors. 

We analyzed the data with the cross-correlation code 
developed by the LIGO stochastic group. The data were 



split into = 40320 segments of length Tseg = 60 s, and 
for each segment the cross-correlation product and the 
theoretical variance were calculated using a template Q ^ 
/^/^ in the range 10 — 500 Hz. The frequency resolution 
of our analysis was 0.25 Hz. The final point estimate at 
100 Hz is given by [Ml EQ] 



opt 



Tseg Xli ^\ 



(54) 



where Yi and ay i are the cross-correlations and variances 
calculated for each segment via Eq. (51), (52) respec- 
tively, and Yopt is the weighted sum 



Yr 



opt 



(55) 



The standard error on this estimate is given by 

-1/2 



CTQ 



(56) 



We found a point estimate at 100 Hz of 2.00 x 10 ^ for 
the pair E1-E2, 1.90 x 10"^ for E2-E3 and 2.03 x 10"^ 
for E2-E3 (an average of ~ 1.97 x 10~^), with error 
(j^ = 4.96 X 10~^^ for the three pairs, at 100 Hz, which 
corresponds to the analytical expectation of ~ 1.9 x 10~^ 
with a precision better than 5%. Even if the back- 
ground from compact binaries is not a Gaussian con- 
tinuous stochastic background, but rather a popcorn-like 
background in the considered frequency range / > 10 Hz 
pT] [58] [6T|, our analysis has shown that non-Gaussian 
regimes can still be recovered by the standard cross- 
correlation statistics, confirming the results of [82^ . 



V. FUTURE DEVELOPMENT 

This first set of Mock Data included only a single type 
of signal, although the BNS systems we simulated are ex- 
pected to be the most numerous and can thus yield much 
interesting information for astrophysics and cosmology. 
Moreover due to computational limitations we did not ex- 
tend the simulations below a frequency of 10 Hz, though 
doing so might significantly improve the ability to ex- 
tract signal parameters. Future Mock Data sets should 
address these and other points by: 

1. Including more types of GW sources; 

2. Using more complete or realistic waveforms; 

3. Using a more sophisticated noise model. 

Under the first heading, binary coalescence signals in- 
cluding stellar mass or intermediate mass black holes 
(IMBH) [T9H22] are of particular interest. A small num- 
ber of burst sources such as Type II supernovae are ex- 
pected in the ET dataset and numerical simulations (for 
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instance [G^) could be used to produce injection wave- 
forms. It is also possible that primordial stochastic GW 
backgrounds exist in the ET sensitive band [18]; detect- 
ing these and determining their parameters would be an 
interesting challenge given the significant contribution of 
astrophysical sources. For BNS coalescences, our injected 
waveforms could be improved by extending the lower fre- 
quency cutoff, but also by modelling the merger phase 
(which depends strongly on the equation of state of NS 
matter, as well as the component masses). We expect 
that significant science can be extracted from BNS merg- 
ers, and from the tidal deformations occurring in the 
pre-merger phase, that are neglected in the PN wave- 
form model we currently use [TTJ [12). Finally, we can 
simulate more realistic noise by adding occasional ran- 
dom glitches to the data, which may be supposed to be 
of instrumental or environmental origin. To create an 
interesting challenge, single-detector glitches should be 
added with a higher rate than detectable signals. 



A. Challenges for CBC analysis 

The initial search for coalescing binaries presented 
here, although moderately efficient below ^ = 1, has some 
significant drawbacks. Here we discuss how it could be 
improved, and point to some current developments in 
CBC data analysis. 

In order to realize the full potential of ET's low fre- 
quency sensitivity down to 10 Hz and below, waveforms 
lasting on the order of an hour or more should be matched 
filtered. For this a simple template bank as used in 
current searches would be prohibitively computationally 
costly [63l [64] containing hundreds of thousands of tem- 
plates or more. Currently, multi-band filter methods are 
being developed [65 which split up the waveform into 
time slices with different frequency content: thus the ear- 
lier part of the waveform can be downsampled, reducing 
computational load. The resulting template banks for 
each time slice are still large, and can be significantly re- 
duced by singular value decomposition [67l [65 allowing 
a computationally realistic search to be performed, while 
retaining the ability to reconstruct the SNR for each of 
the original templates. 

We saw that the sensitivity of the coincident analy- 
sis was limited by the SNR threshold applied to single- 
detector triggers. Due to this threshold, signals from 
distant sources were often seen as double coincidences, 
which compete with a much larger noise background rate 
than triples. However, on lowering the threshold, the 
computational load would increase, as would the number 
of background triple coincidences. As discussed earlier, 
recombining the three detector outputs into synthetic + 
and X data streams should improve the overall separation 
of signal vs. noise. 

If detectors at other locations are operating at the same 
time as ET, a coherent search should be performed to 
maximize sensitivity; such searches are currently under 



development, although facing an obstacle in their com- 
putational costs. 

We did not implement the null stream estimate of 



Eq. ([44|) for the single-detector PSD within the CBC 
analysis. The difference with respect to the individual 
detector PSDs, including the contribution of signals, was 
less than 1%, which we do not expect to cause a measur- 
able change in efficiency; however, if the contributions of 
GW signals were significantly higher, it might be benefi- 
cial to use the null stream PSD for template placement 
and matched filtering. 

To obtain an unbiased estimate of the source parame- 
ters for each signal, a Bayesian analysis of the strain data 
should be performed [69 - 72 . The chief conceptual chal- 
lenge is the likely presence of many signals within any 
stretch of data longer than about a minute [17 . Naively, 
in order to model them one would have to multiply the 
dimensionality of the source parameter space by the num- 
ber of signals, however more efficient methods should ex- 
ist; the problem is analogous to one faced in identifying 
multiple galactic binary sources in mock LISA data [73] 
and [64 suggested that similar algorithms, for instance 
Markov Chain Monte Carlo based codes, could be used 
for ET. There will also be computational challenges in 
performing the analysis on hour-long stretches of data. 

A conceptually difficult problem, not present in the 
current set of ET mock data, is to identify signals among 
an unmodelled background of non- Gaussian noise tran- 
sients when the rates of signals and transients may both 
be large. As seen in the initial CBC analysis, the method 
of background estimation via time shifts between detec- 
tors is invalid if signals are frequent. The broad sensitiv- 
ity spectrum and increased length of binary coalescence 
signals visible in ET gives us hope that signal-based ve- 
toes based on the distribution of matched filter power 
over frequency ^1 or over other parameters will be ef- 
fective in separating signals from noise transients. The 
null stream may also be useful to identify times when 
non- Gaussian noise is likely to produce loud false events, 
and to down-rank or veto such events. 

If there are common non- Gaussian noise transients in 
more than one ET detector, which may be caused by en- 
vironmental disturbances, distinguishing these from GW 
signals may be more difficult, though it is still unlikely for 
such disturbances to cancel completely in the null stream. 
In any case, the use of signal-based vetoes should greatly 
assist in mitigating the effect of common noise for long- 
lived signals such as those from binary neutron stars. 

Current methods for optimizing such vetoes involve 
adding simulated signals to strain data which are as- 
sumed not to contain real signals; these must be revisited 
for ET, for example by using the null stream for simu- 
lations. Single-detector triggers which fail a coincidence 
test could also be used to train glitch rejection methods. 

One way to interpret such methods is to define a de- 
tection statistic for candidate events, with larger values 
indicating greater likelihood of signal vs. noise, for in- 
stance the "re- weighted SNR" of ^ [76]. Under the 
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weak assumption that some number of loud signals can 
be detected with high confidence, we should see an as- 
trophysical event distribution over the statistic value of 
predictable form, superimposed on a population of noise 
transients. If the noise distribution is sufficiently dif- 
ferent from that of signals, ideally decreasing rapidly at 
high statistic values [76 , it may be possible to separate 
the two populations simply by fitting the astrophysical 
component. However, such a procedure would depend on 
the noise event population being sufficiently well under- 
stood. 



B. Challenges for stochastic background analysis 

According to various cosmological scenarios, we are 
bathed in a stochastic background of gravitational waves, 
memory of the first instant of the Universe, up to the lim- 
its of the Planck era and the Big Bang, and often seen as 
the Grail of GW astronomy. Proposed theoretical models 
include the amplification of vacuum fiuctuations during 
infiation, pre-Big-Bang models, cosmic strings or phase 
transitions (see [18l [48] ) . In addition to the cosmological 
background (CGB), an astrophysical contribution (AGB) 
may have resulted from the superposition of a large num- 
ber of unresolved sources since the beginning of stellar 
activity ^56j. In the range of terrestrial detectors (up to 
/ ^ 1 kHz) the AGB is expected to be dominated by 
the cosmological population of compact binaries, in par- 
ticular BNS, and could be a noise that would mask the 
background of cosmological origin. 

In this paper, we assume that the three ET detec- 
tors were independent and thus had no common (cor- 
related) noise. A crucial prerequisite to searching ET 
data for stochastic GW will be to identify and remove 
environmental noise that can corrupt the result of cross- 
correlation analysis with co- located detectors. Relevant 
methods are under development for the two co-aligned 
and CO- located LIGO Hanford detectors [78 . 

One of the most important future tasks will be to sub- 
tract the astrophysical contribution in order to allow de- 
tection of the primordial background. This could be done 
either in the frequency domain by modeling the power 
spectrum with high accuracy from theoretical studies, or 
characterizing its shape using Bayesian analysis of the 
data [77 , or in the time domain by removing individual 
sources as previously discussed. 

The nature of the AGB may also differ from its cosmo- 
logical counterpart, which is expected to be stationary, 
unpolarized, gaussian and isotropic, by analogy with the 
cosmic microwave background. On the one hand, the 
distribution of galaxies up to 100 Mpc is not isotropic 
but strongly concentrated in the direction of the VIRGO 
cluster and the Great Attractor, and on the other hand, 
depending whether the time interval between events is 
short compared to the duration of a single event, the in- 
tegrated signal may result in a continuous, popcorn noise 
or shot noise background [17 . 



In this paper we used the standard cross-correlation 
method for detection of stochastic GW background, but 
new techniques exist or are under development in the 
LIGO/Virgo community to search for non-isotropic [SO', 
^ST\ or non-gaussian stochastic backgrounds [8^ ^ , and 
they will be tested in future challenges. 

Finally, the astrophysical background is not only a 
noise but it could carry crucial information about the 
star formation history, the metallicity, the mass range of 
neutron star and black hole progenitors, their physical 
properties, the rate of compact binary mergers: devel- 
oping methods for parameter estimation will represent 
another important task in future challenges. 



VI. CONCLUSION 

We have described the generation and first analyses 
of a mock data set for the proposed Einstein Telescope 
gravitational- wave observatory, containing a population 
of binary neutron star (BNS) inspiral signals at cosmolog- 
ical distances. Our motivation for this MDC is both for 
data analysis, to consider the different challenges encoun- 
tered for data containing frequent and strong signals, and 
to emphasize science challenges in relating the results of 
data analysis to outstanding questions in fundamental 
physics, astrophysics and cosmology [18 . 

The challenge carried out in this paper is, in many 
ways, similar to the Mock LISA Data Challenge [77 , but 
there are some important technical differences. In the 
case of LISA, the data analysis problem is not CPU or 
memory intensive. Even year long signals at a frequency 
of 10~^ Hz have only tens of thousands of samples. In the 
case of ET, however, CPU and memory limit what prob- 
lems current algorithms are able to address. With the 
software and computer infrastructure that is presently 
available, it is impossible to address the problem of ET 
data analysis to the fullest extent. For example, a binary 
neutron star starting at 1 Hz will last for about 5 days 
and there is no way to filter such long signals with the 
matched filtering algorithms accessible to us. It is neces- 
sary to explore and develop new search algorithms which 
don't require the entire template to be available at once 
to carry out a search. More importantly, future MDCs 
focus on the challenge of extracting useful science from 
ET, not just extraction of GW signals. 

The design topology of the Einstein Telescope allows 
the construction of a unique null stream [53 independent 
of the sky position. We have demonstrated that it is pos- 
sible to recover the average spectrum of the GW signals 
by subtracting the "pure noise" power spectral density 
(PSD) obtained from the null stream, from the PSD in 
each individual detector. The recovered "residual" PSD 
has a power-law character extremely close to the f~^^^ 
behaviour expected for inspiraling binary systems. The 
residual PSD can either be used as a diagnosis tool for 
future Mock Data Challenges and stochastic analyses, or 
as a research tool complementary to a more traditional 
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stochastic analysis. 

The nuh stream is also expected to be a powerful tool 
for identifying and vetoing non- Gaussian features in the 
detector outputs; however, since the current set of ET 
mock data does not include such noise features, this use 
of the null stream will be a topic for future investigation. 

The analysis used to detect coalescing binary signals 
was similar to current pipelines employed in searching 
LIGO- Virgo data, and was able to recover a large frac- 
tion of simulated signals at redshifts approaching unity. 
Some signals were recovered up to redshifts greater than 
3 with good (< 1%) accuracy on chirp mass (the chief 
parameter determining the frequency evolution of inspi- 
ral signals). Overlap between two or more signals only 
rarely affected the performance of the analysis; however 
this could become a more critical issue if the lower fre- 
quency cutoff (taken to be 25 Hz for the first CBC anal- 
ysis) were reduced. 

We also searched for the GW background created by 
the superposition of all the binary inspiral signals up to 
a redshift of z ^ 6 using the standard cross-correlation 
statistic, considering the frequency range 10 — 500 Hz 
where the spectrum can be well approximated by a power 
law ^gw{f) /^/^. Our point estimate at 100 Hz is 
in good agreement with the analytical expectation (with 
a precision better than ~ 5%), and our analysis shows 
that non-Gaussian regimes can be probed by the stan- 
dard cross-correlation statistics near optimal sensitivity, 
confirming the work of [82^. 

Future mock data will include a wider range of sig- 
nals, encompassing CBC signals from BNS, NSBH/BBH, 
IMBH systems; a possible primordial stochastic back- 
ground; and rare burst-like signals such as core-collapse 
supernovae. The challenge will be not only to detect 
these signals but to measure their parameters, and ulti- 



mately to extract a unique range of information about 
astrophysics, fundamental physics and cosmology from 
the data. 

Information on future challenges, and on how to par- 
ticipate will be posted on the ET MDC website ht tp : | 
//www . oca . eu/regiinbau/ET-MDC_web/ET-MDC . html, 
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